Let's Dance: Learning From Online Dance Videos
نویسندگان
چکیده
In recent years, deep neural network approaches have naturally extended to the video domain, in their simplest case by aggregating per-frame classifications as a baseline for action recognition. A majority of the work in this area extends from the imaging domain, leading to visual-feature heavy approaches on temporal data. To address this issue we introduce “Let’s Dance”, a 1000 video dataset (and growing) comprised of 10 visually overlapping dance categories that require motion for their classification. We stress the important of human motion as a key distinguisher in our work given that, as we show in this work, visual information is not sufficient to classify motion-heavy categories. We compare our datasets’ performance using imaging techniques with UCF-101 and demonstrate this inherent difficulty. We present a comparison of numerous state-of-theart techniques on our dataset using three different representations (video, optical flow and multi-person pose data) in order to analyze these approaches. We discuss the motion parameterization of each of them and their value in learning to categorize online dance videos. Lastly, we release this dataset (and its three representations) for the research community to use.
منابع مشابه
Modeling and Annotating the Expressive Semantics of Dance Videos
Dance videos are interesting and semantics-intensive. At the same time, they are the complex type of videos compared to all other types such as sports, news and movie videos. In fact, dance video is the one which is less explored by the researchers across the globe. Dance videos exhibit rich semantics such as macro features and micro features and can be classified into several types. Hence, the...
متن کاملSemantic Modeling and Retrieval of Dance Video Annotations
Dance video is one of the important types of narrative videos with semantic rich content. This paper proposes a new meta model, Dance Video Content Model (DVCM) to represent the expressive semantics of the dance videos at multiple granularity levels. The DVCM is designed based on the concepts such as video, shot, segment, event and object, which are the components of MPEG-7 MDS. This paper intr...
متن کاملInduction The dance of shadows
Statistics stems out from induction. Induction is a long lasting notion in philosophy. The nature of notions in philosophy are such that neither they can be solved completely nor one can leave them forever. One of the most important problem in induction is “the problem of induction”. In this paper we give a short history of induction and discuss some aspects of the problem of induction.
متن کاملHigh aspirations: Transforming dance students from print consumers to digital producers
During 2012, the Dance Department at the University of Surrey developed a set of Open Educational Resources with a Creative Commons license (Attribution, NonCommercial, Share Alike) for dance studies as part of the JISC-funded project Contexts, Culture and Creativity: Enriching E-learning in Dance (CCC:EED) see http://contextscultureandcreativity.wordpress.com/ for details. These OERs exemplify...
متن کاملRecognizing Induced Emotions of Happiness and Sadness from Dance Movement
Recent research revealed that emotional content can be successfully decoded from human dance movement. Most previous studies made use of videos of actors or dancers portraying emotions through choreography. The current study applies emotion induction techniques and free movement in order to examine the recognition of emotional content from dance. Observers (N = 30) watched a set of silent video...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1801.07388 شماره
صفحات -
تاریخ انتشار 2018